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THE PATH TO RECENT PROGRESS ON SMALL GAPS 
BETWEEN PRIMES 

D. A. GOLDSTON, J. PINTZ AND C. Y. YILDIRIM 


1. Introduction 


In the articles Primes in Tuples I & II ([13], [14]) we have presented the proofs 
of some assertions about the existence of small gaps between prime numbers which 
go beyond the hitherto established results. Our method depends on tuple approxi¬ 
mations. However, the approximations and the way of applying the approximations 
has changed over time, and some comments in this paper may provide insight as 
to the development of our work. 

First, here is a short narration of our results. Let 


(1) 

»(„) 

if n 

is prime, 

^ ' |o 

otherwise, 

and 




(2) 

e{N;q,a) := 

E 

e{n). 


n<N 

n=a ( mod q) 


In this paper N will always be a large integer, p will denote a prime number, and 
Pn will denote the n-th prime. The prime number theorem says that 


( 3 ) 


lim 

X —>^oo 


\{p: p< xjl 

X 

log X 


= 1 , 


and this can also be expressed as 


(o E Bin) ~ x as X ^ oo. 

n<x 

It follows trivially from the prime number theorem that 

(5) liminf ^"+^~^" < 1. 

ra^oo log Pn 

By combining former methods with a construction of certain (rather sparsely dis¬ 
tributed) intervals which contain more primes than the expected number by a factor 
of e'*', Maier [25] had reached the best known result in this direction that 


(6) lim inf —^< 0.24846... . 

n^oo log Pn 
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It is natural to expect that modulo q the primes would be almost equally dis¬ 
tributed in the reduced residue classes. The deepest knowledge on primes which 
plays a role in our method concerns a measure of the distribution of primes in re¬ 
duced residue classes referred to as the level of distribution of primes in arithmetic 
progressions. We say that the primes have level of distribution a if 


(7) 


> max 

a 

q<Q = l 


0(iV;9,a) 


N N 

Hq) ^ (logfV)'^ 


holds for any A > 0 and any arbitrarily small fixed e > 0 with 


(8) Q = 

The Bomhieri- Vinogradov theorem provides the level ^, while the Elliott-Halherstam 
conjecture asserts that the primes have level of distribution 1. 

The Bombieri-Vinogradov theorem allows taking Q = (\ogN)~^^^'> in (7), 

by virtue of which we have proved unconditionally in [13] that for any fixed r > 1, 


(9) lim inf < (^/F - 1)" ; 

n^oo logPn 

in particular, 

(10) liminf ^"+^~^" =0. 

n^oo logPn 

In fact, assuming that the level of distribution of primes is a, we obtain more 
generally than (9) that, for r > 2, 


(11) lim inf 7*"+’' —^ 

n^oo \0gPn 

Furthermore, assuming that a > ^, there exists an explicitly calculable constant 
C{a) such that for k > C{a) any sequence of fc-tuples 

( 12 ) {{nhi,nh2, ■ ■ ■ ,n 

with the set of distinct integers H = {hi,h 2 , ■ ■. ,hk} admissible in the sense that 

k 

]^(n -|- hi) has no fixed prime factor for every n, contains at least two primes 

i=l 

infinitely often. For instance if a > 0.971, then this holds for fc > 6, giving 

(13) liminf(p„+i -p„) < 16, 

n—»-oo 

in view of the shortest admissible 6-tuple (n, n -I- 4, n -|- 6, n -I- 10, n -I- 12, n-\- 16). 

We note that the gaps obeying Eq.s (9)-(ll) constitute a positive proportion of 
all gaps of the corresponding kind. By incorporating Maier’s method into ours we 
improved (9) to 


(14) lim inf 7'"+’' —^<g 

n^oo logPn 

but for these gaps we don’t have a proof of there being a positive proportion of all 
gaps of this kind. (These results will appear in forthcoming articles). 

In [14] the result (10) was considerably improved to 


(15) 


lim inf 

n—^oo 


Pn+1 Pn 

(logp„)5(loglogp„)2 


< oo. 
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In fact, the methods of [14] lead to a much more general result: When ^ C N is a 
sequence satisfying A{N) := \{n;n < N,n G ^}| > (^(log A^)^/^(loglog for all 
sufficiently large N, infinitely many of the differences of two elements of A can be 
expressed as the difference of two primes. 


2. Former approximations by truncated divisor sums 


The von Mangoldt function 


(16) 


A(n) 


logp if n=p^,mGlA, 
0 otherwise, 


can be expressed as 

(17) A(n) = ^Ai(rf)log(^) forn>l. 

d\n 


Since the proper prime powers contribute negligibly, the prime number theorem (4) 
can be rewritten as 


(18) i/-(x) := A(n) ~ a; as a; ^ oo. 

n<x 

It is natural to expect that the truncated sum 

(19) AK(n) := ^ ^((i)log(^) for n > 1. 

d\n 

d<R 


mimics the behaviour of A(n) on some averages. 

The beginning of our fine of research is Goldston’s [6] alternative rendering of 
the proof of Bombieri and Davenport’s theorem on small gaps between primes. 
Goldston replaced the application of the circle method in the original proof by the 
use of the truncated divisor sum (19). The use of functions like A}i{n) goes back 
to Selberg’s work [27] on the zeros of the Riemann zeta-function ({s). The most 
beneficial feature of the truncated divisor sums is that they can be used in place of 
A(n) on some occasions when it is not known how to work with A(n) itself. The 
principal such situation arises in counting the primes in tuples. Let 


(20) H = {hi,h 2 ,..., hk} with 1 < hi,..., hk < h distinct integers 

(the restriction of hi to positive integers is inessential; the whole set H can be 
shifted by a fixed integer with no effect on our procedure), and for a prime p denote 
by I'pi'H) the number of distinct residue classes modulo p occupied by the elements 
of H. The singular series associated with the /c-tuple H is defined as 

(21) 6(7f):=n(l--)-'^(l-^)- 

-‘■-‘-0 p 

p 

Since Vp{H) = k iov p > h, the product is convergent. The admissibility of hi is 
equivalent to S(7f) ^ 0, and to VpiH) ^ p for all primes. Hardy and Littlewood 
[23] conjectured that 

(22) ^ A(n;7f) := ^ A(n + /ii) • • • A(n + /ifc) = Af(6(7f) + o(l)), as N ^ oo. 

n<N n<N 
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The prime number theorem is the k = 1 case, and for k > 2 the conjecture remains 
unproved. (This conjecture is trivially true if Ti. is inadmissible). 

A simplified version of Goldston’s argument in [6] was given in [17] as follows. 
To obtain information on small gaps between primes, let 

(23) 

'ilj{n,h) :='ilj{n +h) -'tp{n,h) = ^ A(m), tpR{n,h) := ^ Ai?(m), 

n<m<n+/i. n<m<n+/i 

and consider the inequality 

(24) E > 0. 

N<n<2N 

The strength of this inequality depends on how well A/j(n) approximates A(n). On 
multiplying out the terms and using from [6] the formulas 

(25) 

^ AR{n)AR{n + k)&{{0,k})N, ^ A{n)Aji(n + k)&{{0,k})N (k ^ 0) 

n<N n<N 

(26) 

^ Aninf NlogR, ^ A(n)A/j(n) ~ A^logi?, 

n<N n<N 


valid for jfcj < R< (log A^) gives, taking h = X\ogN with A <C 1, 

(27) 

+ h) — ip{n)Y > (hN log R-\- Nh’^){\ — o(l)) > (^ + A^ — e)A^(log A^)^ 

N<n<2N 

(in obtaining this one needs the two-tuple case of Gallagher’s singular series average 
given in (46) below, which can be traced back to Hardy and Littlewood’s and 
Bombieri and Davenport’s work). If the interval (n,n -I- h\ never contains more 
than one prime, then the left-hand side of (27) is at most 

(28) logA^ + XNllogNf, 

N<n<2N 


which contradicts (27) if A > ^ and thus one obtains 


(29) 


liminf ^"+^ < i. 

n^co logPn 2 


Later on Goldston et al. in [3], [4], [7], [15], [16], [18] applied this lower-bound 
method to various problems concerning the distribution of primes and in [8] to the 
pair correlation of zeros of the Riemann zeta-function. In most of these works the 
more delicate divisor sum 

(30) AK(n) := J] dp{d) 

r<R ' d\{r,n) 


was employed especially because it led to better conditional results which depend 
on the Generalized Riemann Hypothesis. 

The left-hand side of (27) is the second moment for primes in short intervals. 
Gallagher [5] showed that the Hardy-Littlewood conjecture (22) implies that the 
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moments for primes in intervals of length ft, ~ A log N are the moments of a Poisson 
distribution with mean A. In particular, it is expected that 

(31) ^ {^{n h) — V’(n))^ ~ (A + A^)A^(log 

n<N 


which in view of (28) implies (10) but is probably very hard to prove. It is known 
from the work of Goldston and Montgomery [12] that assuming the Riemann Hy¬ 
pothesis, an extension of (31) for 1 < ft < N^~'^ is equivalent to a form of the pair 
correlation conjecture for the zeros of the Riemann zeta-function. We thus see that 
the factor ^ in (27) is what is lost from the truncation level R, and an obvious 
strategy is to try to improve on the range of R where (25)-(26) are valid. In fact, 
the asymptotics in (26) are known to hold for R < N (the first relation in (26) is 
a special case of a result of Graham [21]). It is easy to see that the second relation 
in (25) will hold with R = where a is the level of distribution of primes in 

arithmetic progressions. For the first relation in (25) however, one can prove the 
the formula is valid for R = for a small rj > 0, but unless one also assumes a 

somewhat unnatural level of distribution conjecture for A/j, one can go no further. 
Thus increasing the range of R in (25) is not currently possible. 

However, there is another possible approach motivated by Gallagher’s work [5]. 
In 1999 the first and third authors discovered how to calculate some of the higher 
moments of the short divisor sums (19) and (30). At first this was achieved through 
straightforward summation and only the triple correlations of A;j(n) were worked 
out in [17]. In applying these formulas, the idea of finding approximate moments 
with some expressions corresponding to (24) was eventually replaced with 

(32) {ip{n,h) - p\ogN){ij;ii{n,h) - C)'^ 

N<n<2N 


which if positive for some p > 1 implies that for some n we have i/’(n, ft) > 2 log A^. 
Here C is available to optimize the argument. Thus the problem was switched from 
trying to find a good fit for i/’(n, ft) with a short divisor sum approximation to the 
easier problem of trying to maximize a given quadratic form, or more generally 
a mollification problem. With just third correlations this resulted in (29), thus 
giving no improvement over Bombieri and Davenport’s result. Nevertheless the 
new method was not totally fruitless since it gave 


(33) 


,. . fPn+r-Pn , 

limini —^- < r -, 

n^oo logpn 2 


whereas the argument leading to (29) gives r — Independently of us, Sivak [29] 
incorporated Maier’s method into [17] and improved upon (33) by the factor e~'^ 
(cf. (6) and (14) ). 

Following [17], with considerable help from other mathematicians, in [20] the 
fc-level correlations of A/j(n) were calculated. This leap was achieved through re¬ 
placing straightforward summation with complex integration upon the use of Perron 
type formulae. Thus it became feasible to approximate A(n, H) which was defined 
in (22) by 


(34) 


A_R(n; H) := Ar^u + fti)Ai^(n + h 2 ) ■ ■ ■ AR{n + hk). 
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Writing 

(35) AK(n;H):=(logi?)'=-l«lAfl(n;H), ^kf{n,h)-= E Afl(n;H), 

where the distinct components of the fc-dimensional vector H are the elements of 
the set h) provided the approximation to ^/’(n, hy, and the expression 

k 

(36) {y{n,h)-p\ogN)CYajyR\n,h){logRy-y^ 

N<n<2N j=0 

could be evaluated. Here the aj are constants available to optimize the argument. 
The optimization turned out to be a rather complicated problem which will not be 
discussed here, but the solution was recently completed in [19] with the result that 
for any fixed A > (y/r — y^)^ and N sufficiently large, 

(37) ^ 1». E 1- 

n<N p<N 

p„+r-Pn<>'logp„ p:prime 

In particular, unconditionally, for any fixed rj > 0 and for all sufficiently large 
N > Nq^t]), a positive proportion of gaps Pn+i — Pn with Pn < N are smaller than 
(i + v) log A^. This is numerically a little short of Maier’s result (6), but (6) was 
shown to hold for a sparse sequence of gaps. The work [19] also turned out to be 
instrumental in Green and Tao’s [22] proof that the primes contain arbitrarily long 
arithmetic progressions. 

The efforts made in 2003 using divisor sums which are more complicated than 
Ai{(n) and A_R(n) gave rise to more difficult calculations and didn’t meet with 
success. During this work Granville and Soundararajan provided us with the idea 
that the method should be applied directly to individual tuples rather than sums 
over tuples which constitute approximations of moments. They replaced the earlier 
expressions with 

(38) Z (S A(n + hi) - r\og3N){Aii{n;'H))‘^, 

N<n<2N hiGTi 

where AR{n]Ti.) is a short divisor sum which should be large when is a prime 
tuple. This is the type of expression which is used in the proof of the result described 
in connection with (12)-(13) above. However, for obtaining the results (9)-(ll). 
we need arguments based on using (32) and (36). 

3. Detecting prime tuples 

We call the tuple (12) a prime tuple when all of its components are prime num¬ 
bers. Obviously this is equivalent to requiring that 

(39) Pnin) := (n -I- hi){n + / 12 ) ■ ■ ■ {n + hk) 

is a product of k primes. As the generalized von Mangoldt function 

(40) Afc(n) := Y h{d){log ^)'= 

d\n 
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vanishes when n has more than k distinct prime factors, we may use 

(«) ^ E M<i)(iogf)‘ 

APnin) 

d<R 

for approximating prime tuples. (Here l/k\ is just a normalization factor. That 

(41) will be also counting some tuples by including proper prime power factors 
doesn’t pose a threat since in our applications their contribution is negligible). But 
this idea by itself brings restricted progress: now the right-hand side of (6) can be 
replaced with 1 — 

The efficiency of the argument is greatly increased if instead of trying to in¬ 
clude tuples composed only of primes, one looks for tuples with primes in many 
components. So in [13] we employ 

(42) A«(n;H,£):=^^^ ^ 

d\P-H(n) 

d<R 

where \Ti\ = k and 0 < £ < k, and consider those Pn{n) which have at most k + £ 
distinct prime factors. In our applications the optimal order of magnitude of the 
integer £ turns out to be about \/k. To implement this new approximation in the 
skeleton of the argument, the quantities 

(43) ^ 

n<N 

and 

(44) ^ Aii{n]Hi,£i)Aiiin;H2,£2)0{n + ho), 

n<N 

are calculated as R, N ^ oo. The latter has three cases according as ho ^ H 1 UH 2 , 
or ho G TiiX ^ 2 , or ho £ Hi H H 2 - Here M = \Hi\ + \H 2 \ + £1 + £2 is taken as a 
fixed integer which may be arbitrarily large. The calculation of (43) is valid with R 
as large as and h < R'^ for any constant C > 0. The calculation of (44) can 

be carried out for R as large as and h < R. It should be noted that in [19] 

in the same context the usage of (34) which has k truncations, restricted the range 
of the divisors greatly, for then R < N^~'^ was needed. Moreover the calculations 
were more complicated compared to the present situation of dealing with only one 
truncation. 

Requiring the positivity of the quantity 

(45) 

2N 

E ( E 9{n + ho) - r\og?>N){ ^ Aii{n;H,£))'^, {h = Xlog3N), 

n=N+l l<ho<h T-(C{l,2,...,h} 

\H\=k 

which can be calculated easily from asymptotic formulas for (43) and (44), and 
Gallagher’s [5] result that with the notation of (20) for fixed k 

(46) E &{H) ~ as h ^ 00 , 

H 

yields the results (9)-(ll). For the proof of the result mentioned in connection with 
(12), the positivity of (38) with r = 1 and Aii{n;H,£) for an H satisfying (20) in 
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place of Aii{n;H) is used. For (13), the positivity of an optimal linear combination 
of the quantities for (12) is pursued. 

The proof of (15) in [14] also depends on the positivity of (45) for r = 1 and 
h = modihed with the extra restriction 

(47) {Pn{n), H P) = ^ 

p<-/loglV 

on the tuples to be summed over, but involves some essential differences from the 
procedure described above. Now the size of k is taken as large as (where 

c is a sufficiently small explicitly calculable absolute constant). This necessitates a 
much more refined treatment of the error terms arising in the argument, and in due 
course the restriction (47) is brought in to avoid the complications arising from the 
possibly irregular behaviour of VpiTi.) for small p. In the new argument a modified 
version of the Bombieri-Vinogradov theorem is needed. Roughly speaking, in the 
version developed for this purpose, compared to (7) the range of the moduli q is 
curtailed a little bit in return for a little stronger upper-bound. Moreover, instead 
of Gallagher’s result (46) which was for fixed k (though the result may hold for 
k growing as some function of h, we do not know exactly how large this function 
can be in addition to dealing with the problem of non-uniformity in k), the weaker 
property that is non-decreasing (apart from a factor of 1 -|-o(l)) as a 

function of k is proved and employed. The whole argument is designed to give the 
more general result which was mentioned after (15). 

4. Small gaps between almost primes 

In the context of our work by almost prime we mean an E 2 -number, i.e. a 
natural number which is a product of two distinct primes. We have been able to 
apply our methods to finding small gaps between almost primes in collaboration 
with S. W. Graham. For this purpose a Bombieri-Vinogradov type theorem for 
A * A is needed, and the work of Motohashi [26] on obtaining such a result for the 
Dirichlet convolution of two sequences is readily applicable (see also [1]). In [9] 
alternative proofs of some results of [13] such as (10) and (13) are given couched in 
the formalism of the Selberg sieve. Denoting by the n-th iii 2 -number, in [9] and 
[10] it is shown that there is a constant C such that for any positive integer r, 

(48) liminf(gr„+r - g„) < Cre’'; 

n—^oo 

in particular 

(49) liminf(( 7 „+i - g„) < 6. 

n—^oo 

Furthermore in [11] proofs of a strong form of the Erdos-Mirsky conjecture and 
related assertions have been obtained. 

5. Further remarks on the origin of our method 

In 1950 Selberg was working on applications of his sieve method to the twin prime 
and Goldbach problems and invented a weighted sieve method that gave results 
which were later superseded by other methods and thereafter largely neglected. 
Much later in 1991 Selberg published the details of this work in Volume II of his 
Gollected Works [28], describing it as “by now of historical interest only”. In 1997 
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Heath-Brown [24] generalized Selberg’s argument from the twin prime problem to 
the problem of almost prime tuples. Heath-Brown let 

k 

(50) Il = Y\_{ain + bi) 

i=l 

with certain natural conditions on the integers and bi. Then the argument of 
Selberg (for the case k = 2) and Heath-Brown for the general case is to choose 
p > 0 and the numbers Xd of the Selberg sieve so that, with r the divisor function, 

k 

(51) Q = '^{1-p'^T{a^n + bi)}C^Xdf > 0. 

n<x i—1 (i|n 


From this it follows that there is at least one value of n for which 

k 


(52) 


T{ain + bi) <-. 

^ P 


Selberg found in the case k = 2 that p = ^ is acceptable, which shows that one of 
n and n -I- 2 has at most two, while the other has at most three prime factors for 
infinitely many n. Remarkably, this is exactly the same type of tuple argument of 
Granville and Soundararajan which we have used, and the similarity doesn’t end 
here. Multiplying out, we have Q = Qi — pQ 2 where 

k 

(53) Ad)^ > 0, Q 2 = '^'^T{a^n + bi)}C^Xdf > 0. 

n<x d|n ^=1 n<x d|n 


The goal is now to pick Xd optimally. As usual, the Xd are first made 0 for d > R. 
At this point it appears difficult to hnd the exact solution to this problem. Further 
discussion of this may be found in [28] and [24]. Heath-Brown, desiring to keep Q 2 
small, made the choice 


(54) 


Xd = p{d){ 


log(R/d) fc+i 
logi? ’ ’ 


and with this choice we see 
(55) = 

^ ^ ‘ n<x 


Hence Heath-Brown used the approximation for a fc-tuple with at most fc -I- 1 
distinct prime factors. This observation was the starting point for our work with 
the approximation Adj(n;7i,^). The evaluation of Q 2 with its r weights is much 
harder to evaluate than Qi and requires Kloosterman sum estimates. The weight 
A in Q 2 in place of t requires essentially the same analysis as Qi if we use the 
Bombieri-Vinogradov theorem. Apparently these arguments were never viewed as 
directly applicable to primes themselves, and this connection was missed until now. 
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